Cross-view scene image localization with Triplet Network integrating NetVLAD and Fully Connected Layers
نویسندگان
چکیده
ç ç©¶åºæ¯å¾åçå°çå®ä½é®é¢å¨å®¤å¤å®ä½ãç®æ æå¯»ãåäºä¾¦å¯çé¢åå ·æéè¦æä¹ãéå¯¹è¡æ¯å½±åä¸é¸ç°å½±åä¹é´ç交åè§è§åºæ¯å¾åå¹é ä¸å®ä½é®é¢ï¼æ¬ææåºäºä¸ç§èåå¯è®ç»å±é¨èéæè¿°ååéNetVLADï¼Net Vector of locally aggregated descriptorsï¼åå ¨è¿æ¥å±çä¸å ç¥ç»ç½ç»ï¼Triplet Networkï¼å®ä½æ¹æ³ï¼Tri-NetVLADï¼ãä¸å ç¥ç»ç½ç»ç±ä¸ç»å·ç§¯ç¥ç»ç½ç»CNNï¼Convolutional Neural Networksï¼ææï¼è½åæ¶å¤ç3å¼ å½±åï¼éè¿å¢å¤§ä¸å¹é å对é´çè·ç¦»ï¼åå°å¹é å对é´çè·ç¦»ï¼å®ç°å¾åæ£ç´¢ä¸å¹é ï¼NetVLADåå ¨è¿æ¥å±çèåå¯ä»¥å 强ç¹å¾é´çå ³èæ§ãæ¬æå°CNNæåçå±é¨å·ç§¯ç¹å¾åå«éè¿NetVLADå±åå ¨è¿æ¥å±å¾å°å ¨å±æè¿°ç¬¦ä¸ç¹å¾åéï¼å¹¶å°äºè èåï¼ææå°æåäºå±é¨ç¹å¾é´çå ³èæ§ï¼å¹¶ä¿çäºä¸åå±é¨ç¹å¾ä¹é´ç差弿§ï¼æåäºæ¨¡åçå®ä½ç²¾åº¦ï¼æ¹è¿äºDBL lossï¼Distance-based layer lossï¼ï¼éè¿å å ¥åæ°Î»å¢å¼ºå½æ°å¤å«å°é¾æ ·æ¬çè½åï¼å¨æå模åçæ¶æé度åç¨³å®æ§ç忶乿åäºæ¨¡åçå®ä½ç²¾åº¦ãå¨ç¾å½Vo and Hayså ¬å¼æ°æ®éä¸çå®éªç»æè¡¨æï¼Tri-NetVLADåå¾äºä¼äºMCVPlacesãTriplet eDBL-NetåCVM-Netçç°ææ¹æ³çå®ä½ç²¾åº¦ï¼å¨æµè¯éä¸ç精度é«äº63%ã
منابع مشابه
Estimation of Network Reliability for a Fully Connected Network with Unreliable Nodes and Unreliable Edges using Neuro Optimization
In this paper it is tried to estimate the reliability of a fully connected network of some unreliable nodes and unreliable connections (edges) between them. The proliferation of electronic messaging has been witnessed during the last few years. The acute problem of node failure and connection failure is frequently encountered in communication through various types of networks. We know that a ne...
متن کاملTransitioning Between Convolutional and Fully Connected Layers in Neural Networks
Digital pathology has advanced substantially over the last decade however tumor localization continues to be a challenging problem due to highly complex patterns and textures in the underlying tissue bed. The use of convolutional neural networks (CNNs) to analyze such complex images has been well adopted in digital pathology. However in recent years, the architecture of CNNs have altered with t...
متن کاملScalable Scene Reconstruction and Image Based Localization
In this thesis two fundamental problems in computer vision are addressed: robust and scalable structure from motion and efficient localization from images. These two problems are highly interrelated tasks with several industrial applications, like mapping, navigation and augmented reality. The main contribution of this thesis is in building a complete, robust and scalable image based reconstruc...
متن کاملIn Defense of Fully Connected Layers in Visual Representation Transfer
Pre-trained convolutional neural network (CNN) models have been widely applied in many computer vision tasks, especially in transfer learning tasks. In transfer learning, the target domain may be in a different feature space or follow a different data distribution, compared to the source domain. In CNN transfer tasks, we often transfer visual representations from a source domain (e.g., ImageNet...
متن کاملDeep Neural Networks In Fully Connected CRF For Image Labeling With Social Network Metadata
We propose a novel method for predicting image labels by fusing image content descriptors with the social media context of each image. An image uploaded to a social media site such as Flickr often has meaningful, associated information, such as comments and other images the user has uploaded, that is complementary to pixel content and helpful in predicting labels. Prediction challenges such as ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of remote sensing
سال: 2021
ISSN: ['1007-4619', '2095-9494']
DOI: https://doi.org/10.11834/jrs.20210188